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Abstract —Network switches and routers need to serve packet 
writes and reads at rates that challenge the most advanced 
memory technologies. As a result, scaling the switching rates is 
commonly done by parallelizing the packet I/Os using multiple 
memory units. For improved read rates, packets can be coded 
with an [n,k] MDS code, thus giving more flexibility at read 
time to achieve higher utilization of the memory units. In the 
paper, we study the usage of [n,k] MDS codes in a switching 
environment. In particular, we study the algorithmic problem 
of maximizing the instantaneous read rate given a set of packet 
requests and the current layout of the coded packets in memory. 
The most interesting results from practical standpoint show how 
the complexity of reaching optimal read rate depends strongly 
on the writing policy of the coded packets. 

I. Introduction 

Ever increasing demand for network bandwidth pressures 
switch and router vendors to scale their products at a fast 
pace. The most crucial component for throughput scaling is 
the memory sub-system that comprises the switching fabric. 
As it becomes extremely difficult or prohibitively costly to 
scale the read and write rates of memory units (MUs), an 
alternative solution of choice is to deploy multiple such 
units in parallel. Memory contention in switches occurs when 
multiple packets requested for read at a given instant happen 
to reside in the same memory unit. Because every memory 
unit can deliver a single chunk of data per time instant, this 
contention will result in a loss of switching throughput (while 
at the same time a different memory unit will be idle). Our 
ability to avoid such contention by clever packet placement 
is limited by the fact that the reading schedule of packets is 
not known upon arrival of the packets to the switch. 

This issue has driven switch vendors to seek methods to 
reduce memory contention and thus increase the switching 
throughput. One particularly promising technique is to intro¬ 
duce redundancy to the packet-write path, such that upon read 
the switch controller will enjoy greater flexibility to read the 
requested packets from memory units not contended by other 
read requests. This redundancy is introduced in the form of 
coding, whereby additional coded chunks are calculated from 
the incoming packet and written along with it in the switch 
memory. The simplest scheme of coding applied to packets 
is replication, where the additional chunks written with the 
packet are simply copies of the same packet. The advantage of 
this scheme is in its simplicity, requiring only trivial encoding 
and decoding. A more advanced packet-coding scheme uses 
[n^k] maximum distance separable (MDS) codes 111. This 
coding scheme takes an input of k packet chunks and encodes 
them into a codeword of n chunks (k < n), where any 


k chunks taken from the n code chunks can be used for 
the recovery of the original k packet chunks. This maximal 
flexibility in retrieving the packet makes MDS codes very 
well suited to use in switch memories. Latency comparison 
between replication and MDS codes was pioneered by Huang 
et al. El. It was shown that for k = 2, the average latency 
for serving a packet decreases significantly when a certain 
scheduling model is used. This analysis was later extended by 
Shah et al. in n d, where bounds on latency performance 
under multiple scheduling policies were investigated. 

In this paper, we provide a model of a coded switch that 
considers the number of memory units in use and the code 
parameters. Then we put our focus on maximizing and ana¬ 
lyzing the throughput of the switch. The notion of throughput 
we pursue here is the active MUs serving packets out of the 
MUs in the switch. Increasing the number of packets in the 
system (i.e., the load) gives more choice to the reader, and is 
thus expected to improve the throughput. Eor the probabilistic 
analysis of throughput we use a static distribution on the 
requested-packets’ locations (governed by the write policy), 
and assume that the switch observes this static distribution in 
steady state. The problem of achieving maximal throughput is 
formulated in two equivalent ways as problems in set theory 
and in graph theory. These formulations allow us later to 
obtain insights, algorithms, and bounds for the problem. 

This paper is structured as follows. In Section |I^ we 
provide complexity analysis of the throughput maximization 
problem, where in Section we discuss suboptimal algo¬ 
rithms and bounds on the optimal solution. We then provide 
a structured version of the problem in Section W and show 
that it admits a polynomial-time solution. Einally, conclusions 
are given in Section |V| 


H. Problem Eormulation and Complexity 

Consider a scenario in which multiple packets are stored in 
MUs, to be later forwarded by a switch. Let us assume that 
each packet consists of k chunks, which are MDS-encoded 
into n chunks. The n chunks are then stored in n MUs out of 
N available ones (k < n < N), where overlapping is allowed 
(i.e., two or more packets may share one or more MUs). Out 
of many packets currently stored in the switch memory, a 
request arrives for L packets, with the objective to read as 
many out of these packets in a single time instant. Recall 
that k chunks out of the n encoded ones are sufficient for 
recovering a packet. We know in which MUs these L packets 
are stored, and wish to find methods for reading as many 
packets as possible simultaneously, with the constraint that 
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Fig. 1: Illustration of nkMTP. The patterned rectangles rep¬ 
resent encoded data chunks (n = 3). 


each MU can be accessed only once in the reading process. 
Let us denote by L* the maximal number of packets that can 
be read. We consider the following notion of throughput as 
a performance measure. 

Definition 1: (Throughput) 

The throughput p of the system is defined as: 


We name the problem of maximizing the throughput p as the 
[n,k\-maximal throughput problem, or nkMTP. An instance 
of the problem is illustrated in Figure where data chunks 
of multiple packets are shown on top of MUs storing them. 

The motivation for using MDS codes in this setting rather 
than simple replication is demonstrated in Figure Here we 
compare the use of a [4,2] MDS code for encoding 2 chunks 
of a packet to 2 uses of a [ 2 , 1 ] repetition code (i.e., 2 -way 
replication), one use for each chunk. To read a packet when 
the repetition code is used, the reader needs to read one chunk 
from a specific subset of 2 MUs out of the 4 MUs storing 
the packet, and an additional chunk from the other subset of 
2 MUs. On the other hand, when a [4,2] MDS code is used, 
the reader can recover the packet by reading two chunks from 
any 2 MUs out of the 4 MUs storing the packet chunks. 
The resulting curves in Figure show that on average the 
[4,2] MDS code allows reading more simultaneous packets 
than the [2,1] repetition code. Note that both schemes write 4 
chunks per packet, and hence this advantage comes without 
increasing the write load. 

Evidently, the way packets are stored dictates how they 
should be read for optimal results. In particular, each 
write/read scheme combination may result in different num¬ 
ber of packets that can be read. In this work, we will consider 
two write schemes that are easy to implement, and their 
optimally matched read schemes. For analysis purposes, we 
provide in the rest of this section two equivalent formulations 
of nkMTP. Consider the N available MUs as the elements of 
the set S = {1, 2 ,..., A^}. Each packet i = 1 , 2 ,..., L is stored 
in MUs indexed by a subset Si of S, where \Si\ = n and the 
subsets may overlap. The set theory formulation of nkMTP 
is as follows. 

Problem 1: (nkMTP, set theory formulation) 

Input: Set S = {1, 2,..., A^} and L subsets of S, Si C S, 
such that |5'^| = n. 



Fig. 2: Comparison between a [4, 2] MDS code and two uses 
of the [2,1] repetition code (N = 16). 


Output: Subsets S- ^ Si, \Sl\ = k, S[ D S'' = 0 (i ^ j), 
such that the number of subsets is maximal. 

Example 1: 

A^ = 5,I/ = 3,n = 3. The packets are stored in the MUs 
indexed by the sets Si = { 1 , 2 , 3 },S 2 = { 2 ,4,5}, S 3 = 
{3,4,5}. If /c = n = 3, we have that L* = 1 and the 
recovered packet can be either 1,2 or 3 since Si D Sj 7 ^ 0 
for all = 1,2,3. If /c = 2, a possible solution is 
S^ = {1, 2} , S 2 = {4, 5} with L* = 2. Note that no more 
than 2 packets can be read in this case, since L* < \_N/k\ = 
2. Finally, if k = 1 all the packets can be read, and one 
possible solution is S^ = {1} , S 2 = {2} , S 3 = {3}. 

In addition to the set theory formulation, nkMTP can 
be formulated equivalently on a graph. Consider a bipartite 
graph G = (Vg^Eg), where Vg denotes the vertices of G 
and Eg denotes the edges of G. In addition, let us denote by 
deg{v) the degree of a vertex v e Vg- Since G is bipartite, 
Vg can be partitioned into two disjoint sets of vertices, let us 
denote them by Xg and Yg- Thinking of Xg as packets, and 
of Yg as MUs, vertex i in Xg will be connected to vertex j in 
Yg if one of the encoded chunks of packet i is stored in MU j. 
The resulting graph has the following properties: deg(x) = n 
(fix G Xg). IATgI = L, \Yg\ = N, and \Eg\ = nL. nkMTP 
can be now formulated as follows. 

Problem 2: (nkMTP, graph theory formulation) 

Input: Graph representation G of nkMTP. 

Output: Subsets Xh C Xg.Yh C Yg.Eh C Eg with 
the properties deg(x) = kfieg{y) G {0,1} (Vx G Xn.y G 
Yh). such that \Xh\ is maximal. 

For instance. Example is represented as the graph in 
Eigure The output of Problem is essentially a subgraph 
H of G, providing a solution to nkMTP with throughput 
p=\XH\k/N. 

A. Complexity 

In this subsection, we analyze the computational complex¬ 
ity of nkMTP. Clearly, a simple approach for solving nkMTP 
is to consider all possible assignments of MUs to packets, and 
to choose the assignment leading to the maximal number of 
packets that can be read. However, this approach is clearly 



































Fig. 3: nkMTP from Example formulated on a graph. 


inefficient since its complexity scales exponentially in L. In 
fact, nkMTP can be solved in polynomial time if /c = 1, n > 1 
(i.e., each packet consists of one chunk and the repetition code 
is used) or k = n = 2. On the other hand, nkMTP is NP-hard 
for 3 < k < n. 

Theorem 1: For k = l,n> lor/c = n = 2, nkMTP is 
solvable in polynomial time. 

Proof When /c = 1, n > 1, nkMTP is equivalent to finding a 
subgraph iif of G (a graph representation of nkMTP) that 
is a maximum bipartite matching Q, i.e., containing the 
largest number of matched pairs (x^y), x G Xq^V G Yq, 
such that each pair is connected by an edge and the edges 
are pair-wise non-adjacent. When k = n = 2, consider the 
N MUs as the vertices of a (uni-partite) graph, where an 
edge in this graph connects two MUs shared by the same 
packet. A maximum matching in this graph will provide the 
largest number of disjoint pairs of MUs, each pair serving a 
packet, corresponding to an optimal solution of the nkMTP 
instance. Efficient algorithms are known for finding maximum 
matching in both cases IS. | 

Theorem 2: nkMTP is NP-hard for 3 < k < n. 

To prove Theorem we reduce the /-set packing (/-SP) 
problem || 6 l, known to be NP-hard, to nkMTP. In /-SP, there 
are L sets, each of size /, and the problem is to find the 
maximal number of pair-wise disjoint sets. The details of 
the reduction are provided in Appendix The consequence 
of the hardness result of Theorem |2] is that no efficient 
algorithms are expected to be found for solving nkMTP when 
3 < k < n. However, in the next sections we provide 
algorithmic and analytic results that help solving the nkMTP 
problem in practical settings. We will also see variants of 
nkMTP for which we do find polynomial-time algorithms. 

III. Probabilistic Analysis and Bounds 

The fact that nkMTP turns out to be NP-hard for interesting 
coding parameters is important theoretically, but should not 
discourage one from seeking high-throughput coded switch¬ 
ing. In this section we provide tools that will help find good 
coded-switching solutions in a practical setup. 

A. Lower bounding the maximal solution 

In this sub-section, we provide a lower bound on the num¬ 
ber of packets that can be read, for a given nkMTP instance. 
Consider the following randomized algorithm, applied to a 
graph formulation of nkMTP (Problem [^: 

Algorithm 1: 

1) Calculate the degree of each MU. 


2) Assign each MU independently at random to one of its 
connected packets with probability 1/d, where d is the 
MU degree. 

3) Return the set of packets having at least k connected 
MUs. 

The packets in the set returned by Algorithm can be read, 
since at least k chunks are available for each packet. We 
now turn to calculate the expected size of the set returned 
by Algorithm which will be shown to serve as a lower 
bound on L*. Denote by li an indicator random variable that 
equals 1 if packet i was assigned at least k MUs, and equals 
0 otherwise. In addition, define auxiliary indicator random 
variables li^, which equals 1 if MU j from the set of MUs 
connected to packet i (i.e., j e Si, where Si are defined in 
Section 0 ends up connected to packet i. Thus: 


Pr(;i = l)=Pr I 

KjeSi 


( 2 ) 


Denote by dij the degree of MU j connected to packet 
i. lij are Bernoulli random variables whose success proba¬ 
bilities are 1/di^j. These random variables are independent 
for the same i and different j, but not identically distributed 
(since the MU degrees may vary). The distribution of these 
random variables is called Poisson binomial distribution m 
A closed-form expression for the right-hand side of Equation 
denoted Q(i), is obtained using (Tl: 


qw = i-E 

s=0 


Vk-1 


g-i27rst/(n+l) 

_t=0 


(3) 


• n [iMdij) • + (1 - l/dij) 

jeSi 


/ (n + 1). 


We are now ready to provide a lower bound on L*. 

Theorem 3: For a given nkMTP instance, the number of 
packets that can be read is lower-bounded as follows: 


L* > 


L 




(4) 


Proof Denote by I the cardinality of the set returned by 
Algorithm [^ / is a random variable and its expected value 
(over realizations of sets provided by Algorithm is: 


E[I]=E 

L 

= Eq(*)- 




L L 


Y,E[k]=Y,^r{k = l) 

i=l i=l 


(5) 


Since E [I] is an expected value, there must exist a valid so¬ 
lution to nkMTP with cardinality at least E[I]. The existence 
of such set leads to the lower bound 0- I 

Example 2: 

Consider the nkMTP instance of Figure The MU degrees 
are = 1, = 2, = 2 and so on. The lower bounds 

on L*, obtained using 0, are 2.75, 1.75 and 0.5 for k = 1, 
k = 2 and k = 3, respectively, where the corresponding L* 
values are 3, 2 and 1. 











B. Expected performance of nkMTP ensembles 

In this sub-section, we analyze the nkMTP in a random 
setting, where we consider random ensembles of nkMTP 
instances with fixed parameters k^n,N and L. Assuming a 
graph formulation (Problem [^, an instance taken from an 
ensemble consists of a graph G with L packets and N MUs, 
where the n chunks of each packet are stored independently 
and uniformly at random at n MUs. For each ensemble, we 
would like to estimate the probability of maximum through¬ 
put, i.e. the existence of a solution in form of a subgraph H 
such that Xh = Xq- The method is to identify a condition for 
the existence of such a solution, derived from the following 
extension of Hall’s theorem. 

Theorem 4: (Extended HalTs Theorem Consider a bi¬ 
partite graph G = {Xq^ Eg). Then, G satisfies deg(x) = 
k and deg(^) G {0,1} for all x G Xq and all y G Yq, if and 
only if for every subset W of Xq, 

\T{W)\>k\W\, ( 6 ) 

where T (W) is the set of vertices (in Yq) adjacent to the 
vertices in W. 

That is, the extended Hall’s theorem provides a necessary 
and sufficient condition to determine whether MUs in Yq can 
be assigned to packets in Xq, such that the degree of each 
vertex in Xq will be k. This is equivalent to the existence of 
a maximum throughput solution in G (i.e., each packet can 
be served). In the sequel we say that HalTs condition holds 
for a given subset W of Xq if the condition ([^ of Theorem 
[4] holds. 

Denote by = Um (|^|:^;^) the probability that the 
union of |IU| sets, each containing n elements taken indepen¬ 
dently and uniformly at random from the set { 1 , 2,..., A^}, 
results in a set of size m (m = 1,2, ...,A^). An expression 
for Um was derived in a previous work by us ||9l, based 
on a Markov model that is an extension of the balls-and- 
bins model Go). For a given subset W of Xq, denote by 
Pw the probability that the number of neighbours of W is 
greater than or equal to k\W\ (meaning that Hall’s condition 
holds for W). Since each vertex in W is of degree n, Pw 
is equivalent to the probability that the random union of |IU| 
sets of size n (the number of MUs that store each packet) 
results in a set of size that is greater than or equal to k\W\. 
Thus, 

N 

Pw= Y. Um{\W\-,n,N). ( 7 ) 

m=k\W\ 

Note that Pw from 0 is independent of the particular choice 
of W. Instead, it depends on the size of W, i.e., Pw = P\w\- 

Calculating the probability that all subsets W C Xq 
satisfy Hall’s condition is a difficult task, due to dependencies 
between the subsets. However, a necessary condition for 
an instance to contain a maximum throughput solution is 
that the set Xq satisfies Hall’s condition, which happens 
with probability P\Xa\- Therefore, an upper bound on the 
probability that an instance contains a maximum throughput 
solution is simply P\Xg\- As P\Xg\ obtained by raising a 
Markov matrix to power \Xg\, its calculation is efficient. In 
Figure]^ we first calculate the probability that a random graph 
G contains a maximum throughput solution, by averaging 



Fig. 4: The probability that all \Xg\ packets of a random 
nkMTP instance can be served, as a function of k and n 
(N = 16). 

over the optimal solution sizes of 10 , 000 instances for A: = 2 
and varying values of n. We then compare this empirical 
average to the upper bound obtained using P\Xg\- The results 
show that the upper bound provided by P\Xg\ tight and 
captures the behaviour of the empirical average. The bound 
is tighter as n becomes larger compared to k, meaning that 
in such cases the probability that Hall’s condition is satisfied 
for all Xg subsets is dominated by the probability that 
the set Xg satisfies Hall’s condition. As expected, better 
average performance is achieved as n increases (i.e., when 
redundancy increases). It is also demonstrated that P\Xg\ - 0 
for \Xg I > [N/k\ , since the maximal number of packets that 
can be served is upper bounded by [N/ k \. 

Hall’s condition provides a convenient way for estimating 
expected performance of an ensemble, when concentrating on 
maximum throughput solution. This way, instead of estimat¬ 
ing the probability of maximum throughput empirically by 
averaging over the optimal solution sizes of many nkMTP in¬ 
stances (whose solution is hard in general, see Section [IFA| , 
we may resort to calculate P\Xg\ instead. This is especially 
useful for large values of n and L, for which a direct solution 
of nkMTP (e.g., by considering all possible assignments of 
MUs to packets), may be prohibitively complex. In addition, 
we obtained an efficient way for choosing parameters k and 
n and load L such that good performance is expected. 

IV. Polynomial-Time Solution for a Structured 
Variant of nkMTP 

Motivated by the hardness of the nkMTP problem proved 
above, we now turn to consider variants of the problem that 
can be solved efficiently. Our approach to make the problem 
easier to solve is by changing the packet writing policy upon 
packet arrival. We show in this section that, by a moderate 
restriction on the MUs chosen to store the n chunks of a 
packet, we turn the maximal-throughput read problem to a 
tractable one. 

To the nkMTP problem discussed above we add the 
constraint that the n encoded chunks of each packet are 
stored in n consecutive MUs. We will refer to this scheme as 


















































CnkMTP, for which we show that polynomial-time solution 
exists. We first sort the sets Si in a non-decreasing order of 
the maximal index of the MUs they contain, and w.l.o.g. we 
consider instances of CnkMTP in which the packets are sorted 
accordingly. We begin with an empty set of packets, denoted 
A, and an empty set of MUs, denoted U. The following 
algorithm solves CnkMTP for k < n. 

Algorithm 2: 

1) Initialize S[ = Si for i = 1,..., L. 

2) Set i := 1. 

3) If \S-\ > k, add i to A, and add the k lowest elements 
of Si to U. 

4) Remove all the elements added to O from all the sets 
Sj, for j > i. 

5) Set i:=i + l. If i>I/, stop. Otherwise, go to step 3. 

Theorem 5: The set of packets A and their corresponding 
MUs in U found by Algorithm are an optimal solution to 
CnkMTP. 

Proof The proof starts by observing that w.l.o.g the k MUs 
assigned to a read packet are consecutive. If not, we can 
always exchange MUs between packet j and packets j' > j 
to make the assignment consecutive. 

Now we prove that if \Sl\ > k in step 3, then i appears 
in the optimal solution. We prove by induction on i. Assume 
all packets 1,..., i — 1 can be chosen optimally according to 
Algorithmic Then we show that the i-th packet can be chosen 
in the same way. We assume by contradiction that |5'-| > k 
and there is no optimal solution that contains packet i. Then 
we look at the smallest j > i for which packet j appears in 
the optimal solution. Then from the fact that its k assigned 
MUs are consecutive it is possible to shift the assignment to 
the first MU index in S'-, and replace j by i in the optimal 
solution without affecting the selection of any j' > j. This 
is a contradiction. | 

The operations required in Algorithm [C are simple shifting, 
in addition to the sorting of the packets. Hence its running 
time is clearly polynomial. We also consider a special case 
of CnkMTP, in which the N MUs are partitioned into N/n 
blocks of size n (assuming that n is a divisor of N), where 
the encoded chunks of each packet are restricted to one of 
those blocks. We term this scheme as CnkMTP on blocks. A 
comparison between nkMTP and CnkMTP is given in Figure 
l^for /c = 3,n = 4 and N = W. This figure reveals a very 
interesting tradeoff between throughput and computational 
complexity. Adding structure to the write policy in CnkMTP 
results in some loss of throughput compared with the un¬ 
restricted nkMTP. However, this throughput can be attained 
efficiently, where for nkMTP it is intractable to reach the 
throughput efficiently for large problem instances. It is also 
shown that CnkMTP gives better throughput compared with 
an even more structured writing policy, CnkMTP on blocks. 

V. Conclusion 

In this paper, we anaylzed the fundamental limits of using 
MDS codes in a switching environment. We proved that in 
its most general form, the problem of obtaining maximum 
throughput for a set of requested packets is a hard problem. 
Therefore, we provided bounds and algorithmic tools to aid 



Fig. 5: Scheme comparison. 


its solution in practice. By a simple modification of the 
writing policy used by the switch, we have shown how the 
problem can be solved efficiently. Our work leaves many 
interesting problems for future research, most immediately 
how to tailor the switch and code parameters to match real- 
life network workloads. From a practical point of view, we 
currently investigate the performance of certain MDS codes in 
a switching environment, taking encoding/decoding overhead 
into account. 
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Appendix A 

Detailed proof of Theorem [2] 

To show hardness of nkMTP when 3 < k < n, we first 
define the decision-problem version of nkMTP, which we 
name M-nkMTP In the rest of this appendix, we assume 
that 3 < k < n. 

Problem 3: (M-nkMTP) 

Input: Set theory formulation (Problem of nkMTP and 
a positive integer M. 

Output: ”Yes” if there are M subsets S'- C Si with the 
properties |S-| = k, S[ H S' = 0 (i 7^ j). 

For showing that nkMTP is NP-hard we can equivalently 
show that M-nkMTP is NP-complete. Note that M-nkMTP 
is in NP, since once we are given a collection of M subsets 
S' C Si claimed to be pair-wise disjoint, this can validated in 
polynomial time. It remains to reduce a known NP-complete 
problem to M-nkMTP, meaning that we have to show that an 
efficient solution to M-nkMTP implies an efficient solution to 
this NP-complete problem. We will reduce the I-set packing 
problem (/-SP), known to be NP-complete for / > 3 l6l. to 
our problem. /-SP is defined as follows. 

Problem 4: (/-SP) 

Input: Collection of sets over a certain domain, each of 
them of size /, and a positive integer M. 

Output: ”Yes” if there are M pair-wise disjoint sets. 

Proof First, M-nkMTP is NP-complete for 3 < /c = n, since 
in this case M-nkMTP and /-SP, for I = k = n, are essentially 
the same. Therefore, it remains to reduce /-SP (/ > 3) to M- 
nkMTP for 3 < k < n. Let us begin with reducing /-SP to 
M-nkMTP with k = l^n = k 1. 

Consider an instance of /-SP with I = k, with M de¬ 
noting the number of pair-wise disjoint subsets required in 
the solution. Assume that the input to /-SP are L sets Ai 
(i = 1,2,...,!/), where the elements contained in Ai are 
[JAi = {ai, a2 ,Us}. For building an instance of M- 

i 

nkMTP with /c = /,n = /c + l, do the following: 

• Build sets Bi, each of size k, from s new elements 
{/)!,/)2 ,such that a one-to-one correspondence 
between the elements in Ai and the elements in Bi 
exists: Oj G hj G Bi. 

m Add a new element, say 0, which does not appear in the 
sets Ai or Bi, to each of the sets Ai and Bi. Denote the 
new sets by Ai and Bi. 

The input to M-nkMTP with n = k^l will be the sets Ai 
and Bi, where we will ask whether there exist 2M subsets 
of size k each that are pair-wise disjoint. If /-SP provides 
a solution of size M for the sets Ai, then clearly the sets 
Ai C Ai,Bi C Bi serve as solution of size 2M to M- 
nkMTP with n = /c + 1. On the other hand, if there exists 
a solution of size 2M in the M-nkMTP problem, we have 
three cases: 

1) M subsets A[ C Ai and M subsets B'- C Bi appear 
in the solution. The element 0 can appear in only one 
of the subsets, since they must be pair-wise disjoint. 
If 0 belongs to some A'-, then we have M subsets B[ 
that provide a solution to /-SP (after transforming the 
elements in B[ to the their corresponding elements in 


A'-). On the other hand, if 0 belongs to some B'-, then 
the solution is the sets A'. 

2) Ml subsets A' C Ai and M2 subsets B[ C Bi appear 
in the solution, where Mi < M2 and Mi + M2 = 
2M. 0 can appear in at most one of the subsets B[. In 
addition, M < M2, and therefore choosing the subsets 
B[ that do not contain 0 leads to a solution of /-SP 
with at least M subsets (again, transformation to the 
elements of Ai is required). 

3) Ml subsets B'- C Bi and M2 subsets A' C Ai appear 
in the solution, where Mi < M2 and Mi + M2 = 2M. 
A solution of size at least M to /-SP is obtained in a 
similar way to the previous case. 

The transformation A^ ^ A^, Bi is clearly polynomial in 
L, since it merely requires to build L sets of size k and to 
add one element to each of the resulting 2L sets. Thus, the 
reduction described above is a polynomial time reduction. 
Therefore, M-nkMTP is NP-complete for k >3,n = k ^ 1, 
and it remains to show that M-nkMTP is NP-complete for 
k >3,n > k ^ 1. 

Consider M-nkMTP with k > 3,n = k-\-2. We can reduce 
M-nkMTP with k > 3, n = k f (which we proved to be 
NP-complete) to M-nkMTP with k > 3,n = k-\-2, similarly 
to the reduction of /-SP to M-nkMTP with k = l,n = k 1 
that was described earlier. Continuing in the same fashion, we 
are able to reduce M-nkMTP with n = k^ j (k >3, j > 1) 
to M-nkMTP with n = /c+j +1. Finally, we deduce that M- 
nkMTP is NP-complete for 3 < k < n, meaning that nkMTP 
(the optimization version of M-nkMTP) is NP-hard. | 


